PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSMUA_Achr5P20050_001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Zingiberales; Musaceae; Musa
Family HD-ZIP
Protein Properties Length: 795aa    MW: 85630.9 Da    PI: 7.1209
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSMUA_Achr5P20050_001genomeCIRADView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox672.6e-21113168156
                            TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
               Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                            +++ +++t++q++eLe+lF+++++p++++r eL+++l L+ rqVk+WFqNrR+++k
  GSMUA_Achr5P20050_001 113 KKRYHRHTPQQIQELEALFKECPHPDEKQRMELSNRLCLEVRQVKFWFQNRRTQMK 168
                            688999***********************************************999 PP

2START117.61.7e-372854682165
                            HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S. CS
                  START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv...........dsgealrasgvvdmvlallveellddkeqWdetla. 77 
                            la+ a++elvk+a++eep+W  s   + g e+l  +e ++            + +ea r++gvv+ ++  lv +l+d + +W  +++ 
  GSMUA_Achr5P20050_001 285 LALVAMDELVKMAQLEEPLWIPSL--DAGRETLNHVEYDRCfsrcigprptgFVSEATRETGVVIINSSSLVDTLMDAA-RWADMFPs 369
                            67889*******************..66777777776666666777799999***************************.******** PP

                            ...EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEE CS
                  START  78 ...kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgili 154
                               +a+  +vissg      galqlm aelq+lsplvp R++ f+R+++ql++g w+ivdvS+d  +  p+ s+    +++lpSg+++
  GSMUA_Achr5P20050_001 370 viaRASPADVISSGlggtnnGALQLMHAELQVLSPLVPvREVRFLRFCKQLTEGAWAIVDVSIDGIRGTPSaSPAKTQCRRLPSGCVV 457
                            **9*******************************************************************99**************** PP

                            EEECTCEEEEE CS
                  START 155 epksnghskvt 165
                            ++++ g+skv+
  GSMUA_Achr5P20050_001 458 QDTPTGYSKVI 468
                            *********97 PP

3START32.61.8e-11495536164205
                            EEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                  START 164 vtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                            vtwveh++++++ ++ l+r+l+ sgla ga++wva lqrqc+
  GSMUA_Achr5P20050_001 495 VTWVEHAEYDEAAVPPLYRPLLLSGLALGARRWVASLQRQCQ 536
                            8****************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.24E-20101170IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.1E-22102172IPR009057Homeodomain-like
PROSITE profilePS5007117.297110170IPR001356Homeobox domain
SMARTSM003895.2E-18111174IPR001356Homeobox domain
CDDcd000869.85E-19113171No hitNo description
PfamPF000468.2E-19113168IPR001356Homeobox domain
PROSITE patternPS000270145168IPR017970Homeobox, conserved site
PROSITE profilePS5084839.207275540IPR002913START domain
CDDcd088752.31E-100281536No hitNo description
SuperFamilySSF559611.21E-21281463No hitNo description
SMARTSM002341.8E-29284537IPR002913START domain
PfamPF018525.0E-30285468IPR002913START domain
SuperFamilySSF559611.21E-21492538No hitNo description
PfamPF018529.4E-8495536IPR002913START domain
SuperFamilySSF559612.38E-20565788No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 795 aa     Download sequence    Send to blast
MSFGSLYDGG SGGGARLVGD MPFGNPAGAV SHPRLLSSSL HKSMFSSPGL SLALQTNLDA  60
HGVRNLASVV GGGGGQLDSA RRSKEDENGS RSGSDNLEGG SGDDLEQENP RKKKRYHRHT  120
PQQIQELEAL FKECPHPDEK QRMELSNRLC LEVRQVKFWF QNRRTQMKTQ MERHENMILR  180
QENDKLRAEN LSIREAMRNP MCCNCGGPAV LSEISLEEQH LRMDNARLKD DLDRVRALAG  240
KFLGKPVSAL AGSLPLPLPN SSLELAVGTN GGVDKSPDRF VFLELALVAM DELVKMAQLE  300
EPLWIPSLDA GRETLNHVEY DRCFSRCIGP RPTGFVSEAT RETGVVIINS SSLVDTLMDA  360
ARWADMFPSV IARASPADVI SSGLGGTNNG ALQLMHAELQ VLSPLVPVRE VRFLRFCKQL  420
TEGAWAIVDV SIDGIRGTPS ASPAKTQCRR LPSGCVVQDT PTGYSKVIDH CPSPLRSVSC  480
ISIGSLVLIV KAFGVTWVEH AEYDEAAVPP LYRPLLLSGL ALGARRWVAS LQRQCQSLAI  540
LMSSSLPPDD NTAITPSGRR SMLKLAQRMT DNFCAGVCAS SAREWKKLGG GINIGEDVRV  600
MTRQSVADPG EPPGVVLSAA TSVWLPVSPQ RLFDFLRNEQ LRSQWDILSN GGPMQEMAHI  660
AKGQNTGNAV SLLRASAMNA NQSSMLILQE TCTDTSGSLV VYAPVDIPAM HLVMSGGDSA  720
YVALLPSGFA VLPDGLPSGS VGGARKAGGS LLTVAFQILV NSQPTAKLTV ESVETVNNLI  780
SCTVQKIKAA LNCEP
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009401771.10.0PREDICTED: homeobox-leucine zipper protein ROC5
RefseqXP_009401772.10.0PREDICTED: homeobox-leucine zipper protein ROC5
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLM0T0340.0M0T034_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr5P20050_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP120337116
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein